Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 8693 |
| Missing cells | 1400 |
| Missing cells (%) | 1.0% |
| Duplicate rows | 13 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 159.4 B |
Variable types
| Categorical | 4 |
|---|---|
| Boolean | 3 |
| Numeric | 9 |
| Dataset has 13 (0.1%) duplicate rows | Duplicates |
VIP is highly imbalanced (84.0%) | Imbalance |
HomePlanet has 201 (2.3%) missing values | Missing |
CryoSleep has 217 (2.5%) missing values | Missing |
Destination has 182 (2.1%) missing values | Missing |
VIP has 203 (2.3%) missing values | Missing |
Cabin_Deck has 199 (2.3%) missing values | Missing |
Cabin_Number has 199 (2.3%) missing values | Missing |
Cabin_Side has 199 (2.3%) missing values | Missing |
Age has 178 (2.0%) zeros | Zeros |
RoomService has 5758 (66.2%) zeros | Zeros |
FoodCourt has 5639 (64.9%) zeros | Zeros |
ShoppingMall has 5795 (66.7%) zeros | Zeros |
Spa has 5507 (63.3%) zeros | Zeros |
VRDeck has 5683 (65.4%) zeros | Zeros |
Total_Spend has 3653 (42.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-05 09:28:09.778264 |
|---|---|
| Analysis finished | 2024-03-05 09:33:43.074418 |
| Duration | 5 minutes and 33.3 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
HomePlanet
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 201 |
| Missing (%) | 2.3% |
| Memory size | 393.9 KiB |
| Earth | |
|---|---|
| Europa | |
| Mars |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0438059 |
| Min length | 4 |
Characters and Unicode
| Total characters | 42832 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Europa |
|---|---|
| 2nd row | Earth |
| 3rd row | Europa |
| 4th row | Europa |
| 5th row | Earth |
Common Values
| Value | Count | Frequency (%) |
| Earth | 4602 | |
| Europa | 2131 | |
| Mars | 1759 | 20.2% |
| (Missing) | 201 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| earth | 4602 | |
| europa | 2131 | |
| mars | 1759 | 20.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34340 | |
| Uppercase Letter | 8492 | 19.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 6.2% |
| o | 2131 | 6.2% |
| p | 2131 | 6.2% |
| s | 1759 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 6733 | |
| M | 1759 | 20.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42832 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42832 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
CryoSleep
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 217 |
| Missing (%) | 2.5% |
| Memory size | 393.9 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 217 |
| Value | Count | Frequency (%) |
| False | 5439 | |
| True | 3037 | |
| (Missing) | 217 | 2.5% |
Destination
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 182 |
| Missing (%) | 2.1% |
| Memory size | 393.9 KiB |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 11.187052 |
| Min length | 11 |
Characters and Unicode
| Total characters | 95213 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TRAPPIST-1e |
|---|---|
| 2nd row | TRAPPIST-1e |
| 3rd row | TRAPPIST-1e |
| 4th row | TRAPPIST-1e |
| 5th row | TRAPPIST-1e |
Common Values
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 5915 | |
| 55 Cancri e | 1800 | 20.7% |
| PSO J318.5-22 | 796 | 9.2% |
| (Missing) | 182 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trappist-1e | 5915 | |
| 55 | 1800 | 13.9% |
| cancri | 1800 | 13.9% |
| e | 1800 | 13.9% |
| pso | 796 | 6.2% |
| j318.5-22 | 796 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| S | 6711 | 7.0% |
| - | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| A | 5915 | 6.2% |
| I | 5915 | 6.2% |
| R | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 52304 | |
| Lowercase Letter | 16715 | 17.6% |
| Decimal Number | 14291 | 15.0% |
| Dash Punctuation | 6711 | 7.0% |
| Space Separator | 4396 | 4.6% |
| Other Punctuation | 796 | 0.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| S | 6711 | |
| A | 5915 | |
| I | 5915 | |
| R | 5915 | |
| C | 1800 | 3.4% |
| O | 796 | 1.5% |
| J | 796 | 1.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7715 | |
| c | 1800 | 10.8% |
| i | 1800 | 10.8% |
| r | 1800 | 10.8% |
| n | 1800 | 10.8% |
| a | 1800 | 10.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6711 | |
| 5 | 4396 | |
| 2 | 1592 | 11.1% |
| 3 | 796 | 5.6% |
| 8 | 796 | 5.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6711 |
Space Separator
| Value | Count | Frequency (%) |
| 4396 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69019 | |
| Common | 26194 | 27.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | |
| S | 6711 | |
| A | 5915 | |
| I | 5915 | |
| R | 5915 | |
| c | 1800 | 2.6% |
| i | 1800 | 2.6% |
| r | 1800 | 2.6% |
| Other values (5) | 6992 |
Common
| Value | Count | Frequency (%) |
| - | 6711 | |
| 1 | 6711 | |
| 5 | 4396 | |
| 4396 | ||
| 2 | 1592 | 6.1% |
| 3 | 796 | 3.0% |
| 8 | 796 | 3.0% |
| . | 796 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95213 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| S | 6711 | 7.0% |
| - | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| A | 5915 | 6.2% |
| I | 5915 | 6.2% |
| R | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
Age
Real number (ℝ)
ZEROS 
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.82793 |
| Minimum | 0 |
|---|---|
| Maximum | 79 |
| Zeros | 178 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 20 |
| median | 27 |
| Q3 | 37 |
| 95-th percentile | 55 |
| Maximum | 79 |
| Range | 79 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 14.339054 |
|---|---|
| Coefficient of variation (CV) | 0.49740145 |
| Kurtosis | 0.16715426 |
| Mean | 28.82793 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.42347771 |
| Sum | 250601.2 |
| Variance | 205.60848 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 324 | 3.7% |
| 18 | 320 | 3.7% |
| 21 | 311 | 3.6% |
| 19 | 293 | 3.4% |
| 23 | 292 | 3.4% |
| 22 | 291 | 3.3% |
| 20 | 277 | 3.2% |
| 26 | 268 | 3.1% |
| 28 | 267 | 3.1% |
| 27 | 259 | 3.0% |
| Other values (71) | 5791 |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 67 | 0.8% |
| 2 | 75 | |
| 3 | 75 | |
| 4 | 71 | 0.8% |
| 5 | 33 | 0.4% |
| 6 | 40 | 0.5% |
| 7 | 52 | 0.6% |
| 8 | 46 | 0.5% |
| 9 | 42 | 0.5% |
| Value | Count | Frequency (%) |
| 79 | 3 | < 0.1% |
| 78 | 3 | < 0.1% |
| 77 | 2 | < 0.1% |
| 76 | 2 | < 0.1% |
| 75 | 4 | |
| 74 | 5 | |
| 73 | 7 | |
| 72 | 4 | |
| 71 | 7 | |
| 70 | 9 |
VIP
Boolean
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 203 |
| Missing (%) | 2.3% |
| Memory size | 393.9 KiB |
| False | |
|---|---|
| True | 199 |
| (Missing) | 203 |
| Value | Count | Frequency (%) |
| False | 8291 | |
| True | 199 | 2.3% |
| (Missing) | 203 | 2.3% |
RoomService
Real number (ℝ)
ZEROS 
| Distinct | 1273 |
|---|---|
| Distinct (%) | 14.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 220.00932 |
| Minimum | 0 |
|---|---|
| Maximum | 14327 |
| Zeros | 5758 |
| Zeros (%) | 66.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 41 |
| 95-th percentile | 1256.8 |
| Maximum | 14327 |
| Range | 14327 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 660.51905 |
|---|---|
| Coefficient of variation (CV) | 3.0022322 |
| Kurtosis | 66.577452 |
| Mean | 220.00932 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.3977659 |
| Sum | 1912541 |
| Variance | 436285.42 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5758 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 9 | 25 | 0.3% |
| 8 | 24 | 0.3% |
| 6 | 24 | 0.3% |
| 14 | 21 | 0.2% |
| Other values (1263) | 2509 |
| Value | Count | Frequency (%) |
| 0 | 5758 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 6 | 24 | 0.3% |
| 7 | 17 | 0.2% |
| 8 | 24 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| 14327 | 1 | |
| 9920 | 1 | |
| 8586 | 1 | |
| 8243 | 1 | |
| 8209 | 1 | |
| 8168 | 1 | |
| 8151 | 1 | |
| 8142 | 1 | |
| 8030 | 1 | |
| 7406 | 1 |
FoodCourt
Real number (ℝ)
ZEROS 
| Distinct | 1507 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 448.43403 |
| Minimum | 0 |
|---|---|
| Maximum | 29813 |
| Zeros | 5639 |
| Zeros (%) | 64.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 61 |
| 95-th percentile | 2669.4 |
| Maximum | 29813 |
| Range | 29813 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 1595.7906 |
|---|---|
| Coefficient of variation (CV) | 3.5585851 |
| Kurtosis | 74.856189 |
| Mean | 448.43403 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.1775152 |
| Sum | 3898237 |
| Variance | 2546547.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5639 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 9 | 28 | 0.3% |
| 7 | 27 | 0.3% |
| 10 | 27 | 0.3% |
| Other values (1497) | 2611 |
| Value | Count | Frequency (%) |
| 0 | 5639 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 7 | 27 | 0.3% |
| 8 | 20 | 0.2% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 29813 | 1 | |
| 27723 | 1 | |
| 27071 | 1 | |
| 26830 | 1 | |
| 21066 | 1 | |
| 18481 | 1 | |
| 17958 | 1 | |
| 17901 | 1 | |
| 17687 | 1 | |
| 17432 | 1 |
ShoppingMall
Real number (ℝ)
ZEROS 
| Distinct | 1115 |
|---|---|
| Distinct (%) | 12.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 169.5723 |
| Minimum | 0 |
|---|---|
| Maximum | 23492 |
| Zeros | 5795 |
| Zeros (%) | 66.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 22 |
| 95-th percentile | 912.4 |
| Maximum | 23492 |
| Range | 23492 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 598.00716 |
|---|---|
| Coefficient of variation (CV) | 3.5265616 |
| Kurtosis | 336.01735 |
| Mean | 169.5723 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.763842 |
| Sum | 1474092 |
| Variance | 357612.57 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5795 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 7 | 36 | 0.4% |
| 6 | 34 | 0.4% |
| 13 | 29 | 0.3% |
| 9 | 28 | 0.3% |
| Other values (1105) | 2396 |
| Value | Count | Frequency (%) |
| 0 | 5795 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 6 | 34 | 0.4% |
| 7 | 36 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 23492 | 1 | |
| 12253 | 1 | |
| 10705 | 1 | |
| 10424 | 1 | |
| 9058 | 1 | |
| 7810 | 1 | |
| 7185 | 1 | |
| 7148 | 1 | |
| 7104 | 1 | |
| 6805 | 1 |
Spa
Real number (ℝ)
ZEROS 
| Distinct | 1327 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 304.58886 |
| Minimum | 0 |
|---|---|
| Maximum | 22408 |
| Zeros | 5507 |
| Zeros (%) | 63.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 53 |
| 95-th percentile | 1575.2 |
| Maximum | 22408 |
| Range | 22408 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 1125.5626 |
|---|---|
| Coefficient of variation (CV) | 3.6953503 |
| Kurtosis | 82.920686 |
| Mean | 304.58886 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.7164496 |
| Sum | 2647791 |
| Variance | 1266891.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5507 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 5 | 53 | 0.6% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 7 | 34 | 0.4% |
| 6 | 33 | 0.4% |
| 9 | 28 | 0.3% |
| 8 | 28 | 0.3% |
| Other values (1317) | 2660 |
| Value | Count | Frequency (%) |
| 0 | 5507 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 5 | 53 | 0.6% |
| 6 | 33 | 0.4% |
| 7 | 34 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 22408 | 1 | |
| 18572 | 1 | |
| 16594 | 1 | |
| 16139 | 1 | |
| 15586 | 1 | |
| 15331 | 1 | |
| 15238 | 1 | |
| 14970 | 1 | |
| 13995 | 1 | |
| 13902 | 1 |
VRDeck
Real number (ℝ)
ZEROS 
| Distinct | 1306 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 298.26182 |
| Minimum | 0 |
|---|---|
| Maximum | 24133 |
| Zeros | 5683 |
| Zeros (%) | 65.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 40 |
| 95-th percentile | 1480.2 |
| Maximum | 24133 |
| Range | 24133 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 1134.1264 |
|---|---|
| Coefficient of variation (CV) | 3.8024525 |
| Kurtosis | 87.883437 |
| Mean | 298.26182 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.9045544 |
| Sum | 2592790 |
| Variance | 1286242.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5683 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 5 | 51 | 0.6% |
| 4 | 47 | 0.5% |
| 6 | 32 | 0.4% |
| 8 | 30 | 0.3% |
| 7 | 29 | 0.3% |
| 9 | 25 | 0.3% |
| Other values (1296) | 2531 |
| Value | Count | Frequency (%) |
| 0 | 5683 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 4 | 47 | 0.5% |
| 5 | 51 | 0.6% |
| 6 | 32 | 0.4% |
| 7 | 29 | 0.3% |
| 8 | 30 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| 24133 | 1 | |
| 20336 | 1 | |
| 17306 | 1 | |
| 17074 | 1 | |
| 16337 | 1 | |
| 14485 | 1 | |
| 12708 | 1 | |
| 12685 | 1 | |
| 12682 | 1 | |
| 12424 | 1 |
Transported
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 334.4 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 4378 | |
| False | 4315 |
Passenger_Group
Real number (ℝ)
| Distinct | 6217 |
|---|---|
| Distinct (%) | 71.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4633.3896 |
| Minimum | 1 |
|---|---|
| Maximum | 9280 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 465.6 |
| Q1 | 2319 |
| median | 4630 |
| Q3 | 6883 |
| 95-th percentile | 8819.4 |
| Maximum | 9280 |
| Range | 9279 |
| Interquartile range (IQR) | 4564 |
Descriptive statistics
| Standard deviation | 2671.0289 |
|---|---|
| Coefficient of variation (CV) | 0.57647404 |
| Kurtosis | -1.1817463 |
| Mean | 4633.3896 |
| Median Absolute Deviation (MAD) | 2277 |
| Skewness | 0.0020202219 |
| Sum | 40278056 |
| Variance | 7134395.1 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 4498 | 8 | 0.1% |
| 8168 | 8 | 0.1% |
| 8728 | 8 | 0.1% |
| 8796 | 8 | 0.1% |
| 8956 | 8 | 0.1% |
| 4256 | 8 | 0.1% |
| 984 | 8 | 0.1% |
| 9081 | 8 | 0.1% |
| 8988 | 8 | 0.1% |
| 5756 | 8 | 0.1% |
| Other values (6207) | 8613 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 2 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | |
| 7 | 1 | < 0.1% |
| 8 | 3 | |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9280 | 2 | |
| 9279 | 1 | < 0.1% |
| 9278 | 1 | < 0.1% |
| 9276 | 1 | < 0.1% |
| 9275 | 3 | |
| 9274 | 1 | < 0.1% |
| 9272 | 2 | |
| 9270 | 1 | < 0.1% |
| 9268 | 1 | < 0.1% |
| 9267 | 2 |
Cabin_Deck
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 199 |
| Missing (%) | 2.3% |
| Memory size | 393.9 KiB |
| F | |
|---|---|
| G | |
| E | |
| B | |
| C | |
| Other values (3) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8494 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | F |
| 3rd row | A |
| 4th row | A |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.1% |
| B | 779 | 9.0% |
| C | 747 | 8.6% |
| D | 478 | 5.5% |
| A | 256 | 2.9% |
| T | 5 | 0.1% |
| (Missing) | 199 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 2794 | |
| g | 2559 | |
| e | 876 | 10.3% |
| b | 779 | 9.2% |
| c | 747 | 8.8% |
| d | 478 | 5.6% |
| a | 256 | 3.0% |
| t | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8494 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8494 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8494 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 2794 | |
| G | 2559 | |
| E | 876 | 10.3% |
| B | 779 | 9.2% |
| C | 747 | 8.8% |
| D | 478 | 5.6% |
| A | 256 | 3.0% |
| T | 5 | 0.1% |
Cabin_Number
Real number (ℝ)
MISSING 
| Distinct | 1817 |
|---|---|
| Distinct (%) | 21.4% |
| Missing | 199 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 600.36767 |
| Minimum | 0 |
|---|---|
| Maximum | 1894 |
| Zeros | 18 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 167.25 |
| median | 427 |
| Q3 | 999 |
| 95-th percentile | 1569.35 |
| Maximum | 1894 |
| Range | 1894 |
| Interquartile range (IQR) | 831.75 |
Descriptive statistics
| Standard deviation | 511.86723 |
|---|---|
| Coefficient of variation (CV) | 0.85258959 |
| Kurtosis | -0.71277235 |
| Mean | 600.36767 |
| Median Absolute Deviation (MAD) | 329 |
| Skewness | 0.71835962 |
| Sum | 5099523 |
| Variance | 262008.06 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 82 | 28 | 0.3% |
| 86 | 22 | 0.3% |
| 19 | 22 | 0.3% |
| 56 | 21 | 0.2% |
| 176 | 21 | 0.2% |
| 97 | 21 | 0.2% |
| 230 | 20 | 0.2% |
| 269 | 19 | 0.2% |
| 65 | 19 | 0.2% |
| 123 | 19 | 0.2% |
| Other values (1807) | 8282 | |
| (Missing) | 199 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 1 | 15 | |
| 2 | 11 | |
| 3 | 16 | |
| 4 | 7 | 0.1% |
| 5 | 13 | |
| 6 | 12 | |
| 7 | 9 | |
| 8 | 13 | |
| 9 | 16 |
| Value | Count | Frequency (%) |
| 1894 | 1 | |
| 1893 | 1 | |
| 1892 | 1 | |
| 1891 | 1 | |
| 1888 | 2 | |
| 1886 | 1 | |
| 1884 | 1 | |
| 1880 | 1 | |
| 1878 | 1 | |
| 1877 | 1 |
Cabin_Side
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 199 |
| Missing (%) | 2.3% |
| Memory size | 393.9 KiB |
| S | |
|---|---|
| P |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8494 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P |
|---|---|
| 2nd row | S |
| 3rd row | S |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 | |
| (Missing) | 199 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 4288 | |
| p | 4206 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8494 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8494 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8494 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 4288 | |
| P | 4206 |
Total_Spend
Real number (ℝ)
ZEROS 
| Distinct | 2336 |
|---|---|
| Distinct (%) | 26.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1440.8663 |
| Minimum | 0 |
|---|---|
| Maximum | 35987 |
| Zeros | 3653 |
| Zeros (%) | 42.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 393.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 716 |
| Q3 | 1441 |
| 95-th percentile | 6457.6 |
| Maximum | 35987 |
| Range | 35987 |
| Interquartile range (IQR) | 1441 |
Descriptive statistics
| Standard deviation | 2803.0457 |
|---|---|
| Coefficient of variation (CV) | 1.9453891 |
| Kurtosis | 27.478447 |
| Mean | 1440.8663 |
| Median Absolute Deviation (MAD) | 716 |
| Skewness | 4.4175882 |
| Sum | 12525451 |
| Variance | 7857065.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3653 | |
| 809 | 54 | 0.6% |
| 788 | 40 | 0.5% |
| 804 | 39 | 0.4% |
| 803 | 34 | 0.4% |
| 907 | 32 | 0.4% |
| 908 | 32 | 0.4% |
| 791 | 30 | 0.3% |
| 888 | 29 | 0.3% |
| 716 | 27 | 0.3% |
| Other values (2326) | 4723 |
| Value | Count | Frequency (%) |
| 0 | 3653 | |
| 1 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 11 | 2 | < 0.1% |
| 17 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35987 | 1 | |
| 31076 | 1 | |
| 31074 | 1 | |
| 30478 | 1 | |
| 29608 | 1 | |
| 28074 | 1 | |
| 27848 | 1 | |
| 27842 | 1 | |
| 27650 | 1 | |
| 27428 | 1 |
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Transported | Passenger_Group | Cabin_Deck | Cabin_Number | Cabin_Side | Total_Spend | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PassengerId | ||||||||||||||||
| 0001_01 | Europa | False | TRAPPIST-1e | 39.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | False | 0001 | B | 0 | P | 0.0 |
| 0002_01 | Earth | False | TRAPPIST-1e | 24.0 | False | 109.0 | 9.0 | 25.0 | 549.0 | 44.0 | True | 0002 | F | 0 | S | 736.0 |
| 0003_01 | Europa | False | TRAPPIST-1e | 58.0 | True | 43.0 | 3576.0 | 0.0 | 6715.0 | 49.0 | False | 0003 | A | 0 | S | 10383.0 |
| 0003_02 | Europa | False | TRAPPIST-1e | 33.0 | False | 0.0 | 1283.0 | 371.0 | 3329.0 | 193.0 | False | 0003 | A | 0 | S | 5176.0 |
| 0004_01 | Earth | False | TRAPPIST-1e | 16.0 | False | 303.0 | 70.0 | 151.0 | 565.0 | 2.0 | True | 0004 | F | 1 | S | 1091.0 |
| 0005_01 | Earth | False | PSO J318.5-22 | 44.0 | False | 0.0 | 483.0 | 0.0 | 291.0 | 0.0 | True | 0005 | F | 0 | P | 774.0 |
| 0006_01 | Earth | False | TRAPPIST-1e | 26.0 | False | 42.0 | 1539.0 | 3.0 | 0.0 | 0.0 | True | 0006 | F | 2 | S | 1584.0 |
| 0006_02 | Earth | True | TRAPPIST-1e | 28.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 0006 | G | 0 | S | 0.0 |
| 0007_01 | Earth | False | TRAPPIST-1e | 35.0 | False | 0.0 | 785.0 | 17.0 | 216.0 | 0.0 | True | 0007 | F | 3 | S | 1018.0 |
| 0008_01 | Europa | True | 55 Cancri e | 14.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 0008 | B | 1 | P | 0.0 |
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Transported | Passenger_Group | Cabin_Deck | Cabin_Number | Cabin_Side | Total_Spend | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PassengerId | ||||||||||||||||
| 9272_02 | Earth | False | TRAPPIST-1e | 21.0 | False | 86.0 | 3.0 | 149.0 | 208.0 | 329.0 | False | 9272 | F | 1894 | P | 775.0 |
| 9274_01 | NaN | True | TRAPPIST-1e | 23.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 9274 | G | 1508 | P | 0.0 |
| 9275_01 | Europa | False | TRAPPIST-1e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 9275 | A | 97 | P | 0.0 |
| 9275_02 | Europa | False | TRAPPIST-1e | 32.0 | False | 1.0 | 1146.0 | 0.0 | 50.0 | 34.0 | False | 9275 | A | 97 | P | 1231.0 |
| 9275_03 | Europa | NaN | TRAPPIST-1e | 30.0 | False | 0.0 | 3208.0 | 0.0 | 2.0 | 330.0 | True | 9275 | A | 97 | P | 3540.0 |
| 9276_01 | Europa | False | 55 Cancri e | 41.0 | True | 0.0 | 6819.0 | 0.0 | 1643.0 | 74.0 | False | 9276 | A | 98 | P | 8536.0 |
| 9278_01 | Earth | True | PSO J318.5-22 | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | False | 9278 | G | 1499 | S | 0.0 |
| 9279_01 | Earth | False | TRAPPIST-1e | 26.0 | False | 0.0 | 0.0 | 1872.0 | 1.0 | 0.0 | True | 9279 | G | 1500 | S | 1873.0 |
| 9280_01 | Europa | False | 55 Cancri e | 32.0 | False | 0.0 | 1049.0 | 0.0 | 353.0 | 3235.0 | False | 9280 | E | 608 | S | 4637.0 |
| 9280_02 | Europa | False | TRAPPIST-1e | 44.0 | False | 126.0 | 4688.0 | 0.0 | 0.0 | 12.0 | True | 9280 | E | 608 | S | 4826.0 |
Most frequently occurring
| HomePlanet | CryoSleep | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Transported | Passenger_Group | Cabin_Deck | Cabin_Number | Cabin_Side | Total_Spend | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Earth | False | 55 Cancri e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 3476 | G | 571 | P | 0.0 | 2 |
| 1 | Earth | True | TRAPPIST-1e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | False | 6020 | G | 974 | P | 0.0 | 2 |
| 2 | Earth | True | TRAPPIST-1e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 3519 | G | 577 | P | 0.0 | 2 |
| 3 | Earth | True | TRAPPIST-1e | 2.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 4474 | G | 730 | S | 0.0 | 2 |
| 4 | Europa | True | 55 Cancri e | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 0504 | B | 19 | S | 0.0 | 2 |
| 5 | Europa | True | 55 Cancri e | 30.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 0642 | C | 25 | S | 0.0 | 2 |
| 6 | Europa | True | TRAPPIST-1e | 28.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 3279 | C | 123 | S | 0.0 | 2 |
| 7 | Mars | False | TRAPPIST-1e | 1.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 8681 | F | 1787 | P | 0.0 | 2 |
| 8 | Mars | False | TRAPPIST-1e | 4.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 5142 | F | 1050 | P | 0.0 | 2 |
| 9 | Mars | True | 55 Cancri e | 20.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 2234 | F | 448 | P | 0.0 | 2 |